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1. Introduction 

This article is concerned with harmonic analysis and moment prob- 
lems as motivated by prediction theory and connections to analytic 
function theory and operator theory, continuing a tradition of clas- 
sic works such as Helson-Lowdenslager [12], Helson-Szego [13], and 
Wiener-Masani [23]. Most of these works are concerned with harmonic 
analysis on the unit circle and function theory on the unit disk. In this 
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article, we work in the setting of the two-torus or bi-circle. Helson- 
Lowdenslager [12] was perhaps the first paper to pin down which as- 
pects of harmonic analysis on the circle extend in a straightforward way 
to the bi-circle. Factorization of positive trigonometric polynomials is 
one area that most certainly does not extend in a straightforward way 
from one variable to two, and this topic serves as a good starting point 
to motivate the rest of the paper. 

The classical Fejer-Riesz lemma states that a non-negative trigono- 
metric polynomial t{9) in one variable can be factored as 

t{6) = W 6 )\ 2 

where p e C[z] is a polynomial with no zeros in the unit disk D = {z : 
\z\ < 1}. While this is one of the simplest factorization results it is use- 
ful in signal processing, trigonometric moment problems, and wavelets. 
It is also a prototype for more advanced and important factorization 
results, such as Szego's theorem. A simple degrees of freedom argu- 
ment shows that this result cannot be extended without conditions to 
two variables. In recent years, progress has been made in extending 
this result to two variables. First in Geronimo-Woerdeman [10] a char- 
acterization was given of positive bivariate trigonometric polynomials 
t that can be factored as 

(1.1) t(M) = b(e^)| 2 

2 

where p G C[z, w] is stable, i.e. has no zeros in the closed bidisk D = 
D x D. The characterization is in terms of trigonometric moments of 
the measure 

ddd(f) 

on [0,2tt] 2 , and the necessary and sufficient conditions for the charac- 
terization come from studying measures on T 2 = (<9D) x (<9D) of the 
form 

\dz\\dw\ 
(2n)*\p(z,wW 

where p is a polynomial. These are called Bernstein-Szego measures. 
In one variable, such measures play a natural role since they can be 
used to match a finite sequence of moments of a given positive Borel 
measure in an "entropy maximizing" way — see Landau [20] or Simon 
[23]. 

Surprisingly, the development of the above result passes through a 
sums of squares formula related to p which yields a famous inequality of 
Ando from multivariable operator theory and in turn yields Agler's Pick 
interpolation theorem for bounded analytic functions on the bidisk. 
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This connection is described in Cole-Wermer [7] and Knese [16]. The 
Hilbert space geometry approach of [H] made it possible to extend the 
characterization to the setting where p has no zeros on the open bidisk 
D 2 in [IS]. 

In another direction, an investigation was begun in [H] of orthogo- 
nal polynomials associated with bivariate measures supported on the 
bicircle constructed using the lexicographical or reverse lexicographical 
ordering and the recurrence formulas associated with these polynomi- 
als were developed. As in the one variable spectral theory type 
result was proved relating the vanishing of certain coefficients in the 
recurrence formulas to the existence of a Fejer-Riesz type factorization 
and of a Bernstein-Szego measure with p a stable polynomial. Re- 
cently in [8] , this viewpoint yielded extensions of the above results to 
the problem of characterizing positive bivariate trigonometric polyno- 
mials that can be factored as in (II. ip where now p G C[z, w] has no 
zeros on a closed face of the bidisk. A closed face of the bidisk refers 
to either T x © or © x T. This result is significantly more difficult 
because much of the analyticity of 1/p is lost. However the moments 
can still be computed using the one variable residue theorem. Further- 
more the factorization is in general not of the Helson-Lowdenslager 
type [12] which would give a rational function rather than a polyno- 
mial factorization. Special consideration was given when the trigono- 
metric polynomial t(9,(j)) = \p(z,w)\ 2 = \q(z, w)\ 2 where p(z, w) ^ 
for \z\ = 1, \w\ < 1 whereas q(z,w) ^ for \w\ = 1, \z\ < 1, for 
which a spectral theory result analogous to the characterization of the 
Bernstein-Szego measures on the circle was shown to hold. 

In this article we refine, extend, and give a more complete picture 
of the results in jH]; the case where p has no zeros on a closed face of 
the bidisk mentioned above. In particular we emphasize that positive 
linear forms T on bivariate Laurent polynomials of bounded degree 
which can be represented as a Bernstein-Szego measure as above with 
p(z, w) 7^ for \z\ = 1, \w\ < 1 can be characterized in two differ- 
ent ways by using T to define an inner product on polynomials: (1) 
a matrix condition involving certain natural truncated shift operators, 
and (2) the existence of a special orthogonal decomposition of spaces of 
polynomials we call the split-shift orthogonality condition. The "matrix 
condition" is easier to verify (and matches the condition presented in 
[8] when we choose an appropriate basis) while the "split-shift orthog- 
onality condition" provides more information about the geometry of 
the spaces involved as well as the polynomial p. In particular, this lat- 
ter condition is key to proving a generalization of the sums of squares 
formula alluded to above. A subtle fact is that the spaces involved in 
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the split-shift condition are in general not unique but are in one-to- 
one correspondence with Fejer-Riesz type factorizations of the positive 
trigonometric polynomial t. Furthermore given p we present an explicit 
description of these spaces in terms of the decomposition of p(z, 0) as 
a product of stable and unstable factors. This makes it possible to 
characterize a whole stratification of factorizations of t as \p\ 2 where p 
has no zeros on T x D and p(z, 0) has a specified number of zeros in 
D. The case where p(z, 0) has no zeros in D recovers the Geronimo- 
Woerdeman characterization result, and the case where p(z, 0) has all 
zeros in D results in a related characterization of when t = \p\ 2 where p 
has no zeros in (C\D) x D. In between these two extremes we can char- 
acterize when t = \p\ 2 where the zero set of p in D x C has a specified 
number of sheets over z G sitting in D x D (and a complementary 
number of sheets sitting in D x (C \ D)). 

We proceed as follows. In Section [2] we introduce the notation used 
throughout the paper and state the main theorems. In Section [3] we 
derive basic orthogonality relations associated with Bernstein-Szego 
measures. In Section [4] we show that a Bernstein-Szego measure with 
p(z, w) 7^ for \z\ = 1, | it; | < 1 implies the split-shift condition using 
a decomposition of p(z, 0) into stable and unstable factors. In Sec- 
tion [5] we show that the split-shift condition implies the existence of 
a Bernstein-Szego measure of the type given above. Next in Section 
[6] the matrix condition mentioned above is shown to be equivalent to 
the split-shift condition. In Section [7] we describe all p that give rise to 
the same positive bivariate trigonometric polynomial. In Section [S] we 
show how to construct p from the moments associated with the posi- 
tive linear form. In Section [9] we apply the previous results to solve the 
problem when an extended bivariate autoregressive model has a causal 
or acausal solution. Also in this section we give necessary and suffi- 
cient conditions in terms of moments when a bivariate Borel measure 
supported on the bicircle is a Bernstein-Szego measure with p nonzero 
for \z\ — 1, < 1. Finally in Section [TD] we adapt ideas from [TH] 
to give a second proof of our generalized sum of squares formula which 
should be of independent interest, while we also consider "generalized 
distinguished varieties" and apply an argument of adapted from [17] to 
obtain a sum of squares formula for polynomials associated with these 
varieties. This allows us to obtain a determinantal representation of 
the polynomial giving rise to the variety. Distinguished varieties were 
introduced in [2] and play an important role in multivariable operator 
theory and function theory on the bidisk. Our determinantal represen- 
tation generalizes one of the main theorems of 
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2. Notation and statement of results 
We denote spaces of Laurent polynomials by 

C jik = \f{z s w l : -j < s < j,-k <t < k}, 

where V denotes the complex linear span of a set, and we denote spaces 
of polynomials by 

-p j k ■= y{ z s w t : < s < j, < t < k}, 

where j, k G Z + . In some parts of the paper, the spaces Cj jk and Vj t k 
appear naturally within the context of the Hilbert space L 2 (T 2 ,/i), 
where \i is a positive Borel measure on T 2 , in which case we may use 
also j, k — oo by considering the closed linear spans above. 
A linear form T : £ n , m — > C is said to be positive if 

T(f(z,w)f(l/z,l/w))>0 



for every nonzero / G V n , m , where f(z, w) = f(z, w). With T we define 
an inner product on the space V njm , via 

(/, 9)t = T(f(z, w)g(l/z, l/w)) f,ge V n , m . 

Let us write %t for the finite dimensional Hilbert space (V n ,m, ('i')t)- 
For (k, I) G Z 2 ^ where the inner product above is defined we denote 
the following orthogonal complements: 

(2.1) £l tl = V k ,iewV k ,i-i, 

(2.2) ^ = ^eVi. 

(2.3) £h = V Kl e zV k - X u 



(2-4) H,i = n,i e V, 

We will often employ the anti-unitary reflection operator • 

g(z,w) f-> g(z,w) := z k w l g(l/z, l/w) 

which in this case we say is applied at the degree (k,l). This degree 
will usually be clear from context or explicitly stated. For example, 
applying this operator at degree (k, I) to the spaces T\ l and J = 1 l we 
see that 

fl _ ^1 ar|r j C2 _ ^2 
°k,l — •> k,l dllu °k,l — •> k,h 

since the operator is an anti-unitary in Hq-. 

Definition 2.1. A positive linear form T on C n ^ m satisfies the split- 
shift orthogonality condition if there exist subspaces of polynomials 
/Ci, /C 2 C Hf such that 

(1) £l_ lim = /Cx © K 2 

(2) K\ _L zKi and 
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(3) K u zK2<z£l m . 

The point of conditions (2) and (3) is that they imply K\ © zK>2 C 
S^m- This condition actually characterizes positive linear forms com- 
ing from a Bernstein- Szego measure. What is interesting is that this 
condition can also be expressed using a simple matrix condition. 

To present the matrix condition let us define three operators 

A = P ^lm-i Mz : £ n-l,m -> w8 n,m-l 
T = P£l_^ m M z : £n-l, m — > £n-l,m 

where M z is multiplication by z and Py_ represents orthogonal projec- 
tion onto a subspace H C Hq-. Notice that T is just truncation of 
multiplication by z to S^-i m- 

Theorem 2.2. Let T be a positive linear form on L n<m . The following 
are equivalent. 

(1) (Bernstein- Szego condition) There exists p G C[z, w] with no 
zeros on T x D and degree at most (n, m) such that 

(2.5) T{zPw k )= [ z>w k [f ll f wl \j\<n,\k\<m. 

J T 2 (2tiY\p{z,w)\ 2 

(2) (Split-shift condition) T satisfies the split-shift orthogonality 
condition. 

(3) (Matrix condition) The invariant subspace ofT generated by the 
range of B is contained in the kernel of A. More concretely, 

(2.6) AT j B = for j = 0, 1, . . . ,n - 1. 

This theorem is a more geometric formulation of the results in Geronimo- 
Iliev [8]. In particular, the coordinate free formulation of condition (3) 
makes it possible to give a straightforward proof of the equivalence of 
(2) and (3) in Propositions l6.ll and l6.2j — the original proof in [8] involves 
some non-trivial linear algebra. Of greater significance, however, is our 
emphasis on the split-shift condition and the rather complete knowledge 
it provides of the geometry of Bernstein- Szego measures of the above 
type. A version of the split-shift condition was recognized as an im- 
portant stepping stone in [S], but at that time it was not clear how to 
construct the spaces involved directly with Hilbert space geometry — 
this question was explicitly raised as [SJ Remark 5.3]. The approach 
developed here resolves this. 



POLYNOMIALS WITH NO ZEROS ON A FACE OF THE BIDISK 



7 



Theorem 2.3. Let T be a positive linear form on C n ^ m satisfying the 
Bernstein-Szego condition of Theorem \2. 6 A with polynomial p(z, w) hav- 
ing no zeros on T x D and degree at most (n,m). Then 

JCi = P £ i_ im {g(z)a(z) : g G C[z),degg < degb} 
K 2 = P e x_ xm {g{z)b(z) : g G C[z],degg <n- degb}, 

satisfy the split-shift condition. Here p(z,0) = a(z)b(z) where a G C[z] 
has no zeros in D and b £ C[z] has all zeros in D. 

More explicitly, if we form the span of the following projections of 
one variable polynomials 

z l a{z) - PwVn-i^aiz) for < i < degb 

and 

z { b(z) - P wVn _ Um z l b(z) for < % < n - degb 

then the resulting subspaces satisfy all the orthogonality conditions in 
the split-shift definition. 

Why should we emphasize the abstract looking split-shift condition 
in the first place? One answer to this is that the spaces in the split-shift 
condition appear naturally in the following sum of (hermitian) squares 
result that ends up being an important by-product of our work. 

Theorem 2.4. Suppose p G C[z, w] has no zeros on T x D and degp = 
(n,m). Define p(z,w) = z n w m p(l/z,l/w). Then, there exist polyno- 
mials Ai, . . . , A m , B>x, . . . , B ni , Ci, . . . , C n2 G C[z, w] such that 

\p(z,w)\ 2 - \p(z,w)\ 2 

m / ni ri2 

= (i - M 2 ) E \M^)\ 2 + (i - l*D E \ B >( z > w )\ 2 - E w )\ 2 

j=l \j=l j=l 

where n 2 is the number of zeros of p(z, 0) in D and n\ = n — n 2 . The 
same result holds if p has no zeros inTxB and no factors in common 
with p. 

The different sums of squares terms can be constructed from impor- 
tant subspaces of L 2 ^ dz ^ 2 w ^ ) : the Aj form an orthonormal basis of 

£nm-i-> t ne Bj form an orthonormal basis of (the reflection of /C2), 
and the Cj form an orthonormal basis of JCi. (See Theorem 15.51 ) This 
formula illustrates how natural are the spaces in the split-shift condi- 
tion, and it also reproves some important formulas as special cases. 

When n 2 = 0, p is stable and we get the Cole-Wermer type of sum 
of squares formula [7] which can be used to prove Agler's Pick interpo- 
lation theorem on the bidisk; see also [TO] , [T6] . [9], [19], [6]. The exact 
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numbers of squares involved in this case turned out to be important in 
recent work on extending Lowner's theory of matrix monotone func- 
tions to two variables in Agler-McCarthy- Young [5]. When m = (i.e. 
p does not depend on w) we get a decomposition which readily implies 
part of the Schur-Cohn method for counting the roots of a polynomial 
inside and outside the unit circle. 

The case of p with merely no zeros on T x D can be derived from 
a limiting argument as in (T5J. We give a second proof of the sum of 
squares formula using ideas of Kummert [TjJ| in Section flU.il This proof 
should be of independent interest and has the advantage of working di- 
rectly for all cases. See Section ["l(J.2l for an application of the formula to 
proving a determinantal representation for a class of curves generalizing 
the distinguished varieties of Agler-McCarthy [2] . 

Now that we see that the split-shift condition is natural, we get 
into a deeper discussion of Theorem I2.2[ and its extensions. From the 
maximum entropy principle there are at most finitely many p(z, w) for 
which the Bernstein- Szego condition holds. Theorem 12 . 31 therefore gives 
one particular way to construct /Ci,/C 2 in the definition of split-shift 
and this way is uniquely determined by the choice of p. However, since 
a trigonometric polynomial factored as \p\ 2 can potentially be factored 
in more than one such way — roughly speaking these polynomials can be 
obtained from one another by permuting the factors in \p(z, w)\ 2 which 
depend only on z — each such factorization will yield spaces as in the 
split-shift condition via the above theorem. We prove that these are 
all the possible split-shift decompositions corresponding to \p(z,w)\ 2 . 
See Proposition 17.31 

While each choice of p in the factorization of t = \p\ 2 yields a canon- 
ically associated pair of spaces /Ci,/C 2 i n the split-shift condition, the 
matrix condition naturally gives rise to two canonical choices for such 
pairs. 

Theorem 2.5. Let T be a positive linear form on L n)Tn satisfying the 
matrix condition of Theorem \2.2\ Then, (/Ci,/C 2 ) = (S^_ lm Q B,B) 
satisfies the split-shift condition where 

B = V{T>Bf : / G wFl^j = 0, 1, . . . }. 

Similarly, (/Ci,/C 2 ) = (A, £n-i,m © A) satisfies the split-shift condi- 
tion where 

A = V{(T*yA*f : / G wEl^j = 0, 1, . . . }. 

If (/C'^/C,) is any other pair satisfying the split-shift condition, then 
Ac K! x andB c K! 2 . 
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To be clear, T* : E,\_ x m — > £^-1 m * s gi ven by M\/ z and A* : 

i — * £n-i,m 1S gi ven by Pe 1 M±/ g . See Theorems 17^4] and 17J3] 
where we also show how the spaces in Theorem 12.51 relate to those in 
Theorem 12.31 

Theorems 12.31 and 12.51 directly show how the Bernstein-Szego con- 
dition and the matrix condition yield the split-shift condition. On 
the other hand, if the split-shift condition holds, /Ci © zK.2 has co- 
dimension one in E\ m and we shall show that the Bernstein-Szego 
condition holds using any unit norm element p in the one dimensional 
space Snm © (^i ® zK<i). A sum of squares result related to Theorem 
12.41 ends up being crucial here. In Section [HI we describe a simple pro- 
cedure for constructing p from the moments T{z : 'w k ) once we know the 
split-shift condition holds. 

Our emphasis on the split-shift condition permits several interesting 
refinements that were not evident before. Notice that if p does not 
vanish on T x D, then the argument principle shows that the number 
of zeros of p(-,w) in D will be constant as w varies in D. Thus it 
is possible to prove a "stratified" version of Theorem 12.21 where we 
characterize factorizations involving p with no zeros in T x D such that 
p(z, 0) has a specified number of zeros in D. See the end of Section [7| 
for the proof of the following corollary. 

Corollary 2.6. Let T be a positive linear form on C n>m and let < 
d < n. The following are equivalent. 

(1) (Bernstein-Szego condition) There exists p G C[z, w] with no 
zeros on T x O, degree at most (n,m), and where p(z,0) has d 
zeros in D such that 

T(z^) = [ z>w k \ d 2 ff wl \j\ < n, |*| < m. 

(2) (Split-shift condition) T satisfies the split-shift orthogonality 
condition where 1C\ has dimension d. 

(3) (Matrix condition) The invariant subspace of T generated by 
the range of B is contained in the kernel of A, and 

dim A < d < n — dim B. 

Note A and B are as in Theorem \2.5\ 

In particular, the case d = yields the Geronimo-Woerdeman result 
(as well as much simpler looking conditions). In this case, the split-shift 
condition merely says 

(2-7) z8 n _ l m C S n m . 
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The matrix condition in this case implies A = {0} which implies A = 0. 
Since the range of A is P w£ 2 x z ^n-\,mi ^ s means w ^n,m-i -L z ^n-i,rm 
which is equivalent to (12.71) because of the orthogonal decomposition 

By performing the reflection operation, w£ 2 m _ x _L zE] l _ Xm is equiva- 
lent to J-~n-l,m -L ^n,m-X- 

Corollary 2.7 (Geronimo-Woerdeman [ID]). LetT be a positive linear 

2 

/orm on £ n>m . There exists p 6 C[z, w] with no zeros on D and degree 



at most [n, no) such that 



TfrV) = I ^ J^l \j\<n Ak \<m, 



if and only if 

pi | -pi 

u n—l,m ^ n,m—l' 



To use the language of [2TJ, the last condition can be neatly phrased 
as saying V n -i, m and V n ,m-i intersect at right angles. 

As in [8], Theorem 12.21 allows us to characterize when a positive two 
variable trigonometric polynomial can be factored as \p(z,w)\ 2 on T 2 
where p has no zeros in T x D. 

Theorem 2.8. Suppose t(z,w) = *YTi=-nY^k=- m ^j^w k > for(z,w) G 
T 2 . Then, there exists p G C[2, i£>] of degree at most (n,m) with no ze- 
ros on T x IB) such that t = \p\ 2 on T 2 if and only if the positive linear 
form T on £„ im 

T(z>w k ) = I ^ y 1 \j\ < n, \k\ < m 

satisfies the split-shift condition. 

See the end of Section [5] for a proof of this theorem. 

We say a finite, positive Borel measure \x on T 2 is non- degenerate if 

|/| 2 a>>0, 

for every nonzero polynomial / G C[z, w\. We next turn to the problem 
of characterizing which such measures fi on T 2 are of the form 

(2-8) ±* 

where p G C[z, w] has no zeros in T x D and degree at most (n, m); da 
denotes normalized Lebesgue measure on T 2 . 



POLYNOMIALS WITH NO ZEROS ON A FACE OF THE BIDISK 



11 



Some necessary conditions turn out to be 

(2-9) £n,M = ^n+j,M 

for M > m — 1 and j > 0. These conditions are most likely not 
sufficient though. 

Surprisingly, in j8], it was noticed that conditions (12. 9 p combined 
with the analogous conditions obtained by interchanging the roles of z 
and w characterize when fi has the form 

\p(z,w)q(l/z,w)\ 2da 

2 

where q G C[z, w] have no zeros in D . 

We now provide the following necessary and sufficient conditions for 
fi to have the form (12.81) . Define the following one dimensional spaces 
(2-10) 

Um ■= V 2nM y{z'w k : < j < 2n, < k < M, (j, k) ± (n, 0)}. 

Theorem 2.9. Let d[i be a non- degenerate, finite, positive Borel mea- 
sure on T 2 . There exists p G C[z, w] of degree at most (n,m) with no 
zeros on T x © such that 

da 

fi = W 

if and only if 

for M > m — 1 and j > . 

This theorem is proved in Section 19.21 and in 19.31 it is expressed 
concretely in terms of the moments of [i. In Section |9~TI we discuss the 
close connection of our main theorem, Theorem 12.21 to autoregressive 
filters as was done in |10j . 

3. Basic orthogonalities of Bernstein-Szego measures 

The next two sections are occupied with proving that the Bernstein- 
Szego condition implies the split-shift condition in Theorem 12. 1\ which 
is the content of Theorem 14.81 The approach is an extension of [9] . 

Let p G C[z, w] and assume p(z, w) ^ for (z,w) G T x D. Let 
degp < (n,m), p(z,w) = z n w m p(l/z,l/w). Let da denote normal- 
ized Lebesgue measure on T 2 . We use da(z) = \dz\/ (2tt) or da{w) = 
\dw\/(2ir) to denote normalized Lebesgue measure on T using the vari- 
able z or w. We use (•, •) for the inner product in L 2 (l/\p\ 2 da, T 2 ) and 
V to denote closed linear span in both L 2 (T 2 ) and L 2 (l/\p\ 2 da). This 
is legitimate as L 2 {l/\p\ 2 da) is homeomorphic to L 2 (T 2 ) since \p\ is 
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bounded above and below on T 2 . The next lemma shows that p and p 
are orthogonal to all monomials in half planes. 

Lemma 3.1. In ^(^da), 

p _L z j w k 

for j G Z, k > 1 and 

p _L z j w h 

for j G Z, fc < m. yl/so, 

V-{Vp : j G Z} = V{^w fc :j'6Z,0<K m}9V{2 J M) fc : j G Z, 1 < k < m}. 
Proof. 

(z j w\p) = [ z j [ W J o-(w)da(z) = 
Jt JjP{z,w) 

for k > 1 since l/p(z,w) is holomorphic in w G D when z G T. The 
proof for p is similar. 

For the final part, we have just shown the inclusion C. On the other 
hand, if / G \f{z j w k : j G Z, < k < m}Q\/{z j w k : j G Z, 1 < fc < m} 
and / _L z^p for all j G Z, then 

for all j G Z implies /(z, 0)/p(2;, 0) = for a.e. z G T. (Note 
that f(z, 0) should be interpreted as /(j; O)^ i n -^ 2 -) Therefore, 

/(z, 0) = which implies / G Vjz- 5 w fc : j 6 Z, 1 < fc < m} making / 
orthogonal to itself. So, / = 0. □ 

Define 



J v (z,w) = z^ w)p{irZ ^ 



WT] 



H 71 (z,w) = z^ w)pil/ ^ r]) . 

1 — wr) 

By the previous lemma, H v _L V-fV'it;* : j G Z, < m} for 77 G D and 
J,, 1 V{z j w k : j G Z, fc > 0} for |r/| > 1 since for (z, w) G T 2 



?• »A - ~ W ^ P(z,w)p(l/z,r]) 

Jn{Z, W ) — _ Z _ ._ 

7] 1 — w/rj 



Define 



LJz,w) = L{z,w\rj) = z = = J v (z,w)—H v (z 

1 — W7] 
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which is a polynomial in (z,w,fj) of degree (2n,m — l,m— 1). Notice 
that 



(3.1) z^iwftT^Lil/z, 1/w; 1/fj) = fj^Ly^z, w) = L v (z, w). 

Similarly we define 

G v (z, w) = G(z, w; rf) = J v (z, w) — wf]H v (z, w) 

which is a polynomial in (z,w,fj) of degree (2n,m,m). Note that the 
reflection symmetry for L v (z,w) implies the following symmetry for 
G v (z,w) 



(3.2) z 2n (wfj) m G 1/fj (l/z, 1/w) = G v (z, w). 

Up to factors of z n , L v (z,w) and G v (z,w) are parametrized one- 
variable Christoffel-Darboux kernels. In the next few lemmas, the 
orthogonality properties of p and p are used to obtain orthogonality 
properties on pieces of these kernels. 

Lemma 3.2. If f £ L 2 and supp(f) C Z x Z + , then in L 2 (l/\p\ 2 da) 

k>0 

for 7] G D. In particular, J v _L V{z-'w k : k > 0, j ^ n} for 77 G D. 
Proof. 

<f,4>=ff 

J J T 2 p{z,w) l-wq 
lp^\p( z ^)z n da{z) 

J2f(n,k) V k . 

k>0 



□ 



Lemma 3.3. In L 2 (l/\p\ 2 da), for all n G C 

^ n,0 < k < m} 
and for f G V-jVu> fc : j G Z, < k < m} 



m—l 



(/, A,) = £ /(*,*)»?*■ 



k=0 



Proof. Note L v = - H v . As / _L H v and (/, J v ) = J2T=o f( n > % > 
we see that the desired formula holds for rj G D. Since both sides are 
polynomials in 77, the formula holds for all rj G C. □ 
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Corollary 3.4. In L 2 (l/\p\ 2 da) 

V{L V :i)GB} =V 2n ,m-l (Pn-l,m-l V Z^Vn-i^i) 

= V {z j w k :jeZ,0<k<m} 

e y{z j w k : j 7^ n, < k < m}. 

Proof. We have already shown the L^'s are in the orthogonal comple- 
ments on the right. On the other hand, if any / (in either orthogonal 
complement space) is orthogonal to L v for all t], then f(n, k) = for 
< k < m, implying / = 0. □ 

The next Proposition (see also Corollary 13.71) shows that certain 
orthogonal subspaces are mapped into each other by multiplication by 
z. 

Proposition 3.5. In L 2 (l/\p\ 2 da) 

Poo.m-l V n -^ m -l = V{z j L ri : j > 0, T) G D} 

= V {z j w k : j E Z, < k < m} V{z J w k : j < n, < k < m} 

Poo.m V n -l,m = V{z J G v :j>0,))GD} 

= V {z j w k : j e Z, < k < m} V{^ w fe : j < n, < k < m}. 

Proof. By the Corollary, z^L v is in the orthogonal complement spaces 
for all j > 0, r) G D. On the other hand, if anything in these orthogonal 
complements is orthogonal to z^L n for all j > 0, 77 G D, then such an 
element will have no Fourier support in the set {(j, k) : j > n, < k < 
m} and will be orthogonal to itself. 

We get similar decompositions when we use G v instead of L v and 
allow k = m. □ 

If we apply the anti-unitary reflection operation ^ at degree (n — 
l,m — l) we get other useful decompositions 

(3.3) 

V {z j w k : j < n, < k < m} Q V n ^ m ^ x = V{z j ~ n L v : j < 0, rj G D} 
= y{z J w k :jeZ,0<k<m}e y{z j w k :j>0,0<k<m}. 

Multiplication of the above equation by z gives the following important 
consequence which provides necessary conditions for the full measure 
characterization in Section 

Corollary 3.6. In L 2 {I / \p\ 2 da) , 
for all M > m — 1 . 
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We get this for all M > m — 1 simply because we can view p as a 
polynomial of degree at most (n, M+ 1) for any M > m — 1. Similarly, 
we obtain the following corollary. 

Corollary 3.7. In L 2 (I / \p\ 2 da) , 
for all M > m — 1 . 



4. BERNSTEIN-SZEGO CONDITION IMPLIES SPLIT-SHIFT CONDITION 

Using the same setup as the previous section, we now delve into 
the more refined orthogonalities necessary to prove that the Bernstein- 
Szego condition implies the split-shift condition in Theorem 12.21 Write 
p(z, 0) = a(z)b(z) where a has no zeros in D and b has all zeros in D. 
Let p := degfr and %{z) = zPb(\/z). 

Lemma 4.1. In L 2 (l/\p\ 2 da), for rj G D, we have 



az 3 _L wJ-rj 



for all j < (3. 
For \rj\ > 1, 



w m bz 3 1 H v 



for all j < n — (3. 

Proof. Observe that for j < (3 and < \q\ < 1, {az 3 ,wJ n ) equals 

a(z)z 3 wzMz,v) dw_ da[z)= f ^ a{z)p{z ^ f dw } 



2 p(z,w) 1 — wr] 2iriw J ' Jj 2-rrip(z, w)(w — rj)w 

z 3 - n a(z)p(z,ri) ( — - ) da 



VJt b{z) 

since n — j > deg a. The proof is easier when rj = 0. 
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For \r)\ > 1, (w m bzi,H rj ) equals 



Jt Jt 



for n — (3 > j . □ 

Lemma 4.2. In L 2 (1 / \p\ 2 da) , if f G VjV : j > 0} ; then f J_ z k p for 
all k > if and only if f(z) = a(z)q(z) where q G C[z] has degree less 
than p. 

If f G \/{w m z^ : j > 0}, i/ien / _L z fc p /or all k > if and only if 
f(z,w) = w m b{z)q{z) where q G C[z] /ias degree less than n — (3. 

Proof. If z k p J_ f G V{z J ' : j > 0} for all fc > then 

/(*) ^..w.^- /"**/(*) 



for all fc > implies f(z)/p(z, 0) = z#0) for o G # 2 (T) = V{z J ' : j > 
0}. Then, f(z) = p(z,0)zg(z) and so 



z n - l f{z) = z n p{z^)g{z)eH 2 

implies f(z) is a polynomial of degree at most n — 1. In addition, 
/ = a6o implies a divides /. (We reflect b at degree (3 and a at degree 
n — (3.) So, f = ah where h G C[z] has degree less than (3. Finally, 
/ = aq where q E C[z] has degree less than (3. 
For the converse, let / = aq. Then, 

~ zkf{z) -da( W )da(z) = I ^4da(z) = [ ^da(z) 



T 2p(z,w) J T p(z,0) J T b(z 



Zk+ " q{z) -da{z) = I do-(z) = 



t b(z) Jt b{z) 

for all k > 0. 

The proof of the second part is very similar. □ 
Define 

• ^oo,m © co,m—\ 
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Notice 

V{z j p : j > 0} C K 

and 

W{z j p : j > 0} C C. 

Let Pq denote orthogonal projection onto Poo,m-i and Pq = I — P$. 
Let Pi denote orthogonal projection onto wVoo, m -i and P± = I — Pi- 
The next two lemmas and corollary construct the spaces fCi and JC2 
in the split-shift condition in Definition 12.11 

Lemma 4.3. In L 2 (l/\p\ 2 da) 

JC V{z j p : j > 0} = Pt(V{az j ■ < j < P}) 

C y{z j p : j > 0} = P - L (V{w m fe j : < j < n - /?}). 

Proo/. Let f E JC and / 1 z j p for all j > 0. Write f{z, w) = f(z, 0) - 
wg(z,w) where g G Poo,m-i and notice that Pif = = Pi(f(z,0)) — 
wg(z,w) sothat f = f(z,0)-P 1 f(z,0) = P L ± /M). Since Pi/(*, 0) 1 
for all j > 0, we see that f(z, 0) _L z-*p for all j > 0. By Lemma 
14.21 f(z, 0) = a(z)g(z) where degg < /3. This shows the inclusion C. 

On the other hand, P^az^) = az^ — P\az^ G /C, P\[az^ _L for 
all > 0, and az^ _L z k p for all > by Lemma [4.21 

The second equation has a similar proof. 

□ 

Lemma 4.4. In L 2 (l/\p\ 2 da) 

P^{y{az ] : < j < $}) C P n _i )OT 

P ± (V{ W m W : < j < n - P}) C 7Vi,m. 

Proof. For 0<j<P, P^{az j ) = az j -P 1 {az i ). Clearly, az J ' G P„_i, m , 
so the main thing to show is that P\{az^) G P„,_i, m . 
For fc > 0, 77 G © 

(Pi(az : >),wz k L ri ) = {az\wz k L v ) since wz k L r) G wVoo, m -i 

= (az j ~ k ,wL ri ) 

= (az^~ k ,wJ ri ) since a2; J_fc _L wP,, when |r/| < 1 
= 

by Lemma [4.11 On the other hand, since / = wPi(az^) is an element 

of Poo,m— 1 

m— 1 

(Px^),™^) = (ML„> = ]T /(n + k,t)rf = 

i=0 
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by Lemma 13. 31 So, f(J, k) = for j > n and k £ Z. This shows 
Pi(az J ) =wfe V n -l,m- 

For the second inclusion, the main thing to show is f — Po(w m bz :J ) £ 
V n -i,m for < j < n - ft. For r] £ C, 

m— 1 

(/, z*A,> = £ /(n + fc, *)T7* 

on one hand, while for \rj\ > 1 

(f,z k L v ) = (w m bz j ,z k L v ) 

= (w m z j ~ k b, —H n ) since J ri _L w m z j ~ k b when |r/| > 1 
= 

by Lemma 14.11 since j — k < n — (3. This implies /(n + k,t) =0 for 
> and < i < m as desired. 

□ 

Set 

K, x =Ke V{z j p : j > 0}. 

d = ce y{z j p-. j > o}. 

Corollary 4.5. In L 2 (l/|p| 2 rfa) 

^ = ^i,J V ^ : < j < /?}) C ^ 
A = P^(V{« : < j < n - /?}) C 
£ = P £ i_ i m (V{&^ : < j < n — /?}) C < jm . 

Proof. By Lemmas 14.31 and 14.41 /Ci is contained in both m and 
£* m . Let Pn-im-i denote orthogonal projection onto wV n -\,m-\- For 
any / = az^ — Pi{az^) we know / £ V n -i, m for < j < (3, and so we 
see that P\{az^) = P^-^ m _ 1 P\{az^) = P^-i m -i( az ^)- Therefore, / = 
az j - Pn-i, m -i(az j ) = P £ i_ im {azi), which proves /Q = P £ i_ im iy{az j : 
0< j </?})■ 

The second set of equations has a similar proof. The last set of 
equations follows from the second set by taking the reflection operation 
^ at the degree (n — 1, m). □ 

Let JC2 := £n-im © ^1 so that 
Similarly, define £2 so that 
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The next lemma gives a different characterization of the spaces /C2 
and £ 2 . 

Lemma 4.6. In L 2 (l/\p\ 2 da) 

(4.1) V {z j p : j > 0} © V{wz j L v : j > 0, 77 G D} 

= £ 2 © V{^G„ : j >0,i|Gl} 

(4.2) V {z j p : j > 0} © V{V L„ :j>0,i)GD} 

= £ 2 ®V{« , 'G,:j > 0,r/G©}. 

Proof. Now, 

T~*oo,m © w'Pn— 1,771— 1 (^Poo,m © ^'Poo, m-l) © ^(^00 ,m— 1 © Pn-l,m-l) 

=/C © V{wz j L v : j > 0, 77 G D} 
=/Ci © V-fy'p : j > 0} © V{wz j L v : j > 0, 77 G D} 
by Proposition 13.51 The same set is equal to 

4-i,m © V{^G„ : j > 0, 77 G O} = /Ci © £ 2 © V^'G, : J > 0, 77 G D} 

and after canceling /Ci we obtain 04. ip . The proof for £ 2 follows along 
the same lines by considering "Pn-i.m-i- D 

Lemma 4.7. In L 2 (l/\p\ 2 da) 

(4.3) V {Vp : j < 0} © VjW'^L,, : j < 0, 77 G D} 

= £ 2 © V{^~ n G„ : j < 0, 77 G D} 
where C2 is obtained by reflecting C2 at degree (n — l,m) and 
(4.4) 

V-{Vp : j G Z}©V{w2^ n L r? : j < 0, 77 G B}®V{wz J L v : j > 0, 77 G O} 

= V{^- B G„ : j < 0, 77 G B} © V{^G„ : j > 0, 77 G ©} © 4_ 1>rri . 

Proof. The first part follows from applying the reverse operation ^ at 
the degree (n — 1 , m) in (14.21) and by using (13.11) and (13. 2p . 
The second part comes from decomposing 

y{z j w k : j G Z, < jfe < m} © tu7Vi >w .-i 

in two different ways. By Proposition I3.5[ equation (13. 3p and Lemma 
13.11 it equals 

V {z j w k :jeZ,0<k<m}e V{z j w k : j G Z, 1 < k < m} 
© V{wz n - j L v : j < 0, 7] G D} © V{wz?L v : j > 0, 77 G D} 
= V {z j p : j G Z} © V{wz'- n L v : j < 0, 77 G D} © \Z{w&L v : j > 0, 77 G D}, 
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while it also equals the right hand side of (I4.4p . □ 

Theorem 4.8. Assume p G C[z, w] has no zeros inTxD and degree 
at most (n,m). In L 2 {\ / \p\ 2 da) , for 

Kx = P £U JV{azi-.0<j<P}) 
Ci = P^_ lm (y{bz^ :0<j<n-P}) 

we have 
and 

Consequently, the split-shift orthogonality condition holds for a positive 
linear form associated to a Bernstein-Szego measure with p having no 
zeros in T x D. 

Proof. By Corollary 14. 5[ it is enough to prove 

Ci = JC 2 

and 

The direct sum of the left sides of (14. ip and ( 14. 3D yields the left side 
of ( I4.4p . So, the direct sum of the corresponding right hand sides are 
equal which means 

/C 2 © y{z j G v : j > 0, G ©} © £ 2 © V{^- n G„ : j < 0, 77 G B} 
= V {^-"G, : j < 0, 77 G D} ® V{^G, : j > 0, rj G ©} © #_ 1>m . 

Therefore, E^_ x = /C 2 © £ 2 . But, o^_ l m = Fn-i,m = £1 © £2 and so 
/C 2 = £1. 

We know Cp, /Ci C m by Lemma 13.11 and Corollary 14.51 By defi- 
nition of K.\ we know JC\ _L p. 

Now, using Corollary 13.71 we see that 

y{z J w k :j>0,0<k<m}e V{z j w k : < j < n, 1 < k < m} 

decomposes into 

JC © Zw(Voo,m-l © V n -X,m-x) 

= V {z j p : j > 0} © JCi © V{zwz>L v : j > 0, 77 G D} 
=Cp © /Ci © ^(V{2 j p : j > 0}) © z{V{wz?Lr, : i > 0, 77 G D}) 
=Cp © /d © z(/C 2 © V{z j G v : j > 0, 77 G D}) by @3J 
=Cp © /d © ^/C 2 © z(V{^'G„ : j > 0, 77 G D}) 
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but it also decomposes into 

£ l n , m ®z(W{^G 11 :j>0,r ] eB}) 

and therefore 

£} nm = Qo©/Ci ®z!C 2 . 

□ 

5. Split-shift condition implies Bernstein-Szego condition 

The goal now is to prove that the split-shift condition (see Definition 
12. ip implies that T can be represented using a Bernstein-Szego measure 
whose associated polynomial has no zeros on T x D. 

We call the pair (/C l7 /C 2 ) a shift-split of E\ m . By dimensional consid- 
erations £n t m O (/Ci ©^2) will be one dimensional, and therefore of the 
form Cp for some unit norm p. We shall call p a split-poly associated 
to the shift-split. The point now will be to prove that a split-poly p 
has no zeros on T x © and along the way we will prove some interesting 
formulas for p (which will also give formulas for an arbitrary p with no 
zeros on T x D since we can apply our formulas to l/\p\ 2 da). 

There may be more than one shift-split of £^ m , but we shall see 
that each split-poly is associated to one shift-split. We will provide a 
description of all split-polys (and hence all shift-splits via the previous 
section) in Section [7J 

Let Kj k be the reproducing kernel for Vj t k in %t- Namely, for 
(C,rj) e C 2 , (K j>k )^ >v) (-, •) = K j)k (-,-;C,v) is the unique element of 
Vj t k such that 

(/, (A- ilfc )( C ,,)>r = /(C,»?) 

for all / e Vj, k . 

Remark 5.1. We shall use some standard facts about reproducing 
kernels of polynomials on T 2 . See Section 3 of [T6j . 

(1) The reproducing kernel of an orthogonal direct sum is the sum 
of the reproducing kernels. 

(2) Shifting a subspace by z (resp. w) multiplies the reproducing 
kernel by z( (resp. wfj). 

(3) The "reflection" ^ of a subspace "reflects" the reproducing ker- 
nel. 

On this last point, if H is a subspace of polynomials of degree at 
most (j, k) and H is its reproducing kernel, the subspace 

H := {ziw k f(l/z, 1/w) : / e H} 
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has reproducing kernel 

H(z, w; C, v) ■= (z() j (wfj) k H(l/(, 1/m 1/w). 

The degree (J, k) at which we reflect will either be mentioned explicitly 
or will be the maximal degree of the elements of the subspace. 

Using these manipulations we get the following formulas. 

Ej = Kj^ m — wfjKj tm -i = the reproducing kernel for £■ 

F- = E- = Kj jTn — Kj m _i = the reproducing kernel for J-^ 

E k = K n k — zC,K n _ lk = the reproducing kernel for £ 2 k 

F k = E\ = K n> k — K n -i t k = the reproducing kernel for T^ k . 

For example, the first formula follows from the orthogonal decomposi- 
tion 

We record some basic formulas which do not require any special or- 
thogonality conditions. In fact, they are just the result of manipulating 
the equations above. 

Lemma 5.2. We have 

Ej(z,w;C,v) ~ F j(z,w;C,v) = (1 - wfj)K jim -i(z, w; C,rj) 

El(z, w; C, v) ~ F k(z, w; (, rj) = (1 - z()K n ^ k (z, w; (, rj). 

If {E (z, w), . . . , E m (z, w)} is an orthonormal basis for £%,m then we 
write 

E 2 m (z, w) = (E (z, w),..., E m (z, w)) = (1, w, . . . , w m )E 2 m {z) 

for an appropriate (m + 1) x (m + 1) matrix polynomial E^z). Then, 

E 2 m (z, w- C, 77) = E 2 m (z, w)E 2 m ((, n)* = (1, w, . . . , w m )E 2 m (z)E 2 m (CY(l, v, ... , rj m )*. 

Lemma 5.3. The matrix polynomial E^(z) is invertible for all z£D. 

Proof. Suppose E^zq) is singular for some z G C and choose nonzero 
v G C m+1 such that E 2 m (z )v = 0. Then, 

/(*, w) = E 2 m (z, w)v = (1, w, . . . , w m )E 2 Jz)v 

is in £ 2 m , and f(z ,w) = for all w. So, f(z,w) = (z — z )g(z,w) for 
some g G P„-i, m . Since / _L zg we have 

ll/-^ll 2 = ll/ll 2 + lkll 2 = kol 2 lkll 2 . 

Then, Il/H 2 = |H| 2 = (\z Q \ 2 - l)\\g\\ 2 which implies |z | > 1- □ 
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Let K\ be the reproducing kernel for JCx, and let K 2 be the repro- 
ducing kernel for K 2 . When p has unit norm, the reproducing kernel 
for Cp is p(z, w)p((, rj) but we will simply write pp. 

Lemma 5.4. If(Kx,K 2 ) is a shift-split of E\ m with split-polyp then 

E\_ x = K l + K 2 

E l n = K l + zC,K 2 +pp 

Fn-l = Kx + K 2 

F^ = zck l + k 2 + ^ 

where the kernels K\ and K 2 are reflected at the degree (n — l,m). 

Proof. These all follow from Remark 15.11 and the definition of shift- 
split. □ 

Theorem 5.5. If (Ki, K 2 ) is a shift-split of E\ m with split-polyp then 

pp-pf=(l- wffiE^ + (1 - z()(K 2 - K x ) 

= (l- wr] )Fl_ 1 + {l-zQ{K 2 -k l ) 

= (1 - wf))Fl_ x + (1 - z(){K 2 - Kx) + (1 - z()(l - wfj)K n „x, m -i 

and 

pp — wfjpp = (1 — wff)E m + (1 — zQ(wfjK 2 — Ki) 
= (1 - wrj)Fl + (1 - z()(K 2 - wfjKi) 

= (1 - wf))Fl + (1 - z(){wfjk 2 - + (1 - <)(1 - wf\)K n -i. 
Proof. Combining Lemmas 15.21 and 15.41 we get 

zCiK, +K 2 - + K 2 )) = (1 - W^K^m-l 

and 

K x + z(K 2 +pp- [zC,kx + K 2 +pp) = (l- wff)K n ^x. 
Subtract these two formulas to get 

(1 - zC,){Kx -K 2 )+pp-w = (l- wff)E^_ 1 

which rearranges to get the first desired formula. Similar arguments 
give the remaining formulas. □ 

If T comes from a Bernstein-Szego measure l/\p\ 2 da where p has no 
zeros on T x D, then Theorem 14.81 implies p is a split-poly and then the 
above theorem immediately implies Theorem I2.4[ the sum of squares 
theorem from the introduction since reproducing kernels can be written 
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as a sum of squares of an orthonormal basis. In general, the sum of 
squares formula implies a split-poly has no zeros on T x D. 

Corollary 5.6. If p is a split-poly, then p has no zeros on T x D. 

Proof. We use the second set of formulas in Theorem 15.51 Suppose 
p(z, w) = for some (z, «j)gTxD. Setting z = (GT and w = r\ G D 
we have 

\p(z, w)\ 2 — \w\ 2 \p(z, w )| 2 = —\wp(z, w)\ 2 = (1 — {w^E^z, w; z, w) > 

which shows wp(z,w) = 0. Then, for arbitrary 77 G C we have 

= E 2 m (z, w- z, 77) = E 2 m (z, w)E 2 m (z, r))* = (1, w, . . . , w; m )^(z)^(z)*(l, 77, . . . , V m ) 

which implies 

0=(l,w,...,w m )E 2 m ( Z )E 2 m (zy 
contradicting the fact that E^z) is invertible from Lemma [5.31 □ 

We can now prove the split-shift orthogonality condition implies the 
Bernstein-Szego condition in Theorem 12.21 

Corollary 5.7. Suppose two positive linear forms T\ andTi both satisfy 
the split-shift condition with the same split-poly p. Then, T\ = %, and 
the linear forms agree with the linear form associated with the measure 
l/\p\ 2 da. 

Proof. It is enough to show the reproducing kernels K n>m are the same 
for both forms. We can form a matrix polynomial E^z) corresponding 
to each form 7i and T2, say E\(z) and E 2 (z) (just in this proof; we will 
not use this notation elsewhere). Using Theorem 15.51 for 2 = (6T and 
arbitrary w,r] G C we get E 1 (z)E 1 (z)* = E 2 (z)E 2 (z)* for z G T using 
arguments similar to the previous proof. Then, 

E 2 -\z)E 1 {z) = E 2 {l/z) t E^ 1 {l/z) t . 

By Lemma |5~3"| the left hand side is analytic for \z\ < 1, while the right 
hand side is analytic for \z\ > 1. By Liouville's theorem E 2 1 (z)Ei(z) = 
V is a constant unitary matrix. This in turn implies the reproducing 
kernels E^ZjW; (,rj) for Ti, T 2 are the same. By the next lemma, we 
may conclude that T\ = T 2 . □ 

Lemma 5.8. The inner product in Hf is determined by the reproduc- 
ing kernel E 2 ^. 

Proof. Notice F^(z,w;C,v) = (<) n M) m ^(l/C, 1/w l/z, l/w). So, 
E^ determines F^. By Lemma 15.21 E^ determines i^„_i im and since 
K n ^ m = E^ + z(K n _i jUl we see that E^ determines K n ^ m as well. □ 

We can now prove Theorem 12.81 
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Proof of Theorem \2.8\ . We already know that if t = \p\ 2 then the split- 
shift condition holds. On the other hand, if the split-shift condition 
holds with split-poly p, then p has no zeros on T x D and the form 
corresponding to l/\p\ 2 da agrees with the form T. Then, by Cauchy- 
Schwarz 

1 = ( I ^w d V </«*,/■ w io= I i ia (% da= , 

\Jt* |P| \Tt ) J T 2 \p\ 2 J T 2 t J T 2 t Jj2 \p\ 2 

since the forms agree. Since we have equality in our application of 
Cauchy-Schwarz, it is not hard to see t = \p\ 2 . □ 

6. The matrix condition 

The abstract flavor of the split-shift orthogonality condition makes 
it difficult to check. This section is devoted to showing it is equivalent 
to checking that a number of natural operators vanish. 

The "matrix condition" ( 12.61) from Theorem 12.21 can be viewed as 
saying the smallest invariant subspace of T containing the range of B 
is contained in the kernel of A. 

Proposition 6.1. // a positive linear form T satisfies the split-shift 
condition with shift-split (/Ci, /C2), then the matrix condition (12. 6p holds 
and 

(6.1) V{(T*yA*f : / G w£l m _ 1: j = 0, 1, . . . } C K x 

(6.2) V{T*Bf : / G wJ^^j = 0, 1, . . . } C /C 2 . 

Proof. Let (/Ci,/C 2 ) be a shift-split of £-\ m . Then, S^-im = K<i ® k<2 
and E\ m = ¥L\ © zJC 2 © Cp where p is the associated split-poly. 

The strategy is to prove (1) the range of B is contained in /C2, (2) 
/C2 is an invariant subspace of T (T/C2 C /C2), and (3) AK,2 = 0. This 
will imply that (I2.6P holds as well as (16. 2p . 

Since /Ci C £^ m , /Ci J- wJ r2 m _ l and therefore for g G JC\, f G 

So, the range of B is orthogonal to /Q and therefore must be contained 
in /C 2 . 

To show T/C2 C /C2, let f'2 G /C2 and g± G JC\. Since ZJC2 _L /Ci, we 
know gi _L zf 2 and therefore 

(Tf 2 , gi ) = (M z f 2 , gi ) = 0. 

So, T/C 2 -L JCi and thus T/C 2 C /C 2 . 
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Finally, since zK 2 C S^ m _L w£\ m _ x , we must have 
AK 2 = P wt a M z JC 2 = 0. 

^n,m— 1 

The proof of (16. ip is similar if we work with adjoints of our operators. 

□ 

Proposition 6.2. Suppose T is a positive linear form on C n ^ m satis- 
fying the matrix condition (12.61) . Set 

/C 2 = V{T*Bf : / G wTl m _ v j = 0, 1, . . . } 

and /Ci = Sn-im © ^-2- Then, (/Ci,/C 2 ) a shift-split of S^-im an d 
hence T satisfies the split-shift orthogonality condition. 

Proof. Notice that by the Cayley-Hamilton theorem we do not need to 
consider all powers of T in the definition of IC 2 , so that 

K 2 = V{T*Bf : j = 0, 1, . . . , n - 1, / G wJ*^} 

and also AT j B = for j = 0, 1, 2, . . . . 

We need to show zfC 2 _L /Ci and 2/C2, /Ci C £^ m . 

To prove zfC 2 _L /Ci, simply note that for / G wJ-^ m _ 1; g G /Ci, 
j = 0, 1, 2, . . . , we have 

(zT^Bf,g) = (T^ 1 Bf,g) = 0. 

The proves -L JC\ since is spanned by elements of the form 
zT'Bf. 

To show zK 2 C E\ m , note that 

zJC 2 C P n , m e zty? n _i, m -i = E x n m © wS\ m _ x 

and therefore it is enough to show zK 2 _L m „ 1 . So, for / G u>.F^ m _ x 
and g G m-1 we have 

(zT j Bf, g) = (AT j Bf, g) = j = 0, 1,2... 

since A = P wS 2 M z and AT^B = 0. This proves z/C 2 C £^ >m - 

Similarly, to show /Ci C £* m it is enough to show /Ci _L wJ-^ m _ 1 , 
since /Ci C "P nifn wV n -i, m -i- Observe that for / G wJ^ m _j and 
g G /Ci, 5/ G /C 2 and so 

0={Bf 7 g) = {f,g) 
and therefore if^ m _i -L /Ci. □ 
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7. Description of shift-splits and split-polys 

If the split-shift condition holds for T, then we have seen that T can 
be represented using moments of a measure l/\p\ 2 da where p G C[z, w] 
has no zeros inTxi and p has degree at most (n, m). The description 
of all such p is essentially an algebra problem. 

Lemma 7.1. Let t(z,w) be a two variable trigonometric polynomial 
which is factorable as \p(z,w)\ 2 where p has degree at most (n,m) and 
no zeros in T x B. 

Then, there exists a g G C[z, w] with no zeros in T x D none of whose 
irreducible factors involve z alone, and there exists a stable polynomial 
q G C[z] (no zeros on ID)) such that 

t(z, w) = \q(z)g(z, w)\ 2 for (z, w) G T 2 

Moreover, if t(z,w) = \pi(z,w)\ 2 where pi has degree at most (n,m) 
and no zeros on T x D, then there exist qi, q^ G C[z] such that q = qiq 2 
and 

Pi(z,w) = q 1 (z)q 2 (z)g(z,w) 

Proof. Suppose t = \p\ 2 as above. We may factor p(z,w) = h(z)g(z,w) 
where g has no irreducible factors involving z alone. By the one variable 
Fejer-Riesz lemma we can factor \h\ 2 = \q\ 2 with q, a stable one variable 
polynomial. Then, t(z,w) = \q(z)g(z,w)\ 2 on T 2 . 

Now, if t = \pi\ 2 as above, then again pi(z, w) = hi(z)gi(z,w) where 
gi has no irreducible factors involving z alone. Now, 

h 1 (z)g 1 (z,w)h 1 (z)g 1 (z,w) = h(z)g(z,w)h(z)g(z,w) 
on T 2 which implies 

h 1 {z)g 1 {z,w)hi{z)gi{z,w) = q(z)g(z,w)q(z)g(z,w) 
on all of C 2 , when we reflect at appropriate degrees. Then, for z G T 
hi(z)g!(z,w) _ q(z)g(z,w) 
q(z)g(z,w) HiO)<?iO,w) 

and the left side is holomorphic for all w G © and the right side is 
holomorphic for \w\ > 1 making the function entire and rational in w. 
The same can be said for the reciprocal and this forces the function to 
be constant in w. So, for z G T 

hi(z)g!(z,w) = hi(z)gi(z,0) 
q(z)g(z,w) q(z)g(z,0) 

and we see 

gi(z, w)g(z, 0) = g(z, w)g 1 (z, 0). 
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This extends to all z £ C and since g and gi have no irreducible factors 
involving z alone, we may conclude they are constant multiples of one 
another. The constant can be absorbed into the definition of hi so 
that pi(z, w) = hi(z)g(z,w). Then, \pi\ 2 = \p\ 2 on T 2 implies that 
l^-i 1 2 — \q\ 2 on T. It is then elementary to show hi is obtained by 
flipping some of the roots of q to inside D. □ 

Lemma 7.2. If the split-shift orthogonality condition holds with a given 
split-poly p, then the spaces /Ci and JC 2 are uniquely determined by p. 

Proof. Looking at the formulas in Theorem 15.51 we see that since all 
of the E or F kernels are uniquely determined, the kernels K 2 — Ki 
and wfjK 2 — Ki are uniquely determined. We see that (1 — wfj)Ki 
is uniquely determined and so Ki is uniquely determined. A similar 
argument shows K 2 is uniquely determined. □ 

We can now give a description of all possible shift-splits. 

Proposition 7.3. If the split-shift condition holds for a positive linear 
form T on C n , m , then there exists g £ C[z, w] with no zeros on T x D 
and no irreducible factors involving z alone and stable q £ C[z], such 
that q(z)g(z,w) is a split-poly. Write ni := deg z g and n := n — n\. 
Every other split-poly is of the form 

1i{z)q 2 {z)g{z,w) 

where deg (71(72 < n , q(z) = qi(z)q~2(z), and q 2 is reflected at the degree 
of (72- The associated shift-split (/Ci,^) is given by 

JCi = VP £ i_ im {z j qi{z)gi(z) : < j < degq 2 + degg 2 } 
fC 2 = \/P £ i_ im {z 3 q 2 (z)g 2 (z) :0<j<n-degq 2 - degg 2 } 

where g(z,0) = gi(z)g 2 (z) with gi having no zeros in D and g 2 having 
all zeros in D. 

Proposition 16.21 singles out the shift-split with minimal K, 2 which 
would correspond to the split-poly q(z)g(z,w), where q(z) is reflected 
at degree n . The shift-split with minimal /Ci corresponds to split-poly 
q(z)g(z, w). This leads to a canonical decomposition of £^_ lm which 
does not depend on a choice of shift-split. Let degg = (ni,m) and 
n := n — ri\. 

Define 

/C = V{z j g(z,w) : < j < n } 

A = VPeU.J^vWfrW '■ ^ 3 < de S^} 
B = V p ei_ l7m {z j q(z)92(z) : < j < m - degg 2 }. 
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Theorem 7.4. Let T be a positive linear form on £ n , m satisfying the 
split-shift condition. Then, 

£n-l,m = ^0 © A © B. 

Both (/Co © A, B) and (A, /Co © B) are shift-splits. If (/Ci, /C2) is any 
shift-split, then A C /Ci and B C /C2. 

Proof. It follows by inspection of definitions that A. C /Ci and B G IC2 
using /Ci and /C2 from Proposition 17.31 

Let g G C[z, u>] and q G C[z] be as in the previous proposition. In 
L 2 (l/\qg\ 2 da), g _L z^w k+1 for j G Z and /c > because 

JqWJr^M (2tt) 2 

since l/g(z,-) is holomorphic. Therefore, z j g(z,w) G £n-im f° r — 
j < n and we see 

^0 = VP g i_ lira {2^i(z)0 2 (*) : < j < n Q } 

is contained in 

VP £ i_ im {z j gi (z) : < j < n + deg# 2 } 

but this corresponds to /Ci in the shift-split coming from the split- 
poly q(z)g(z,w). Hence, this space and /Co must be orthogonal to the 
associated /C2 which happens to be B. Notice also that 

A C VP £ i_ im {z j gi (z) : < j < n + degg 2 }. 

A similar argument shows /Co is orthogonal to A, and by dimension 
considerations 

/C © A = yP £ i_ itm {z j gi {z) : < j < no + deg# 2 } 

and again by dimension considerations 

/C © A © B = £n-i, m - 

We already noted that /Co © A corresponds to "/Ci" in some shift- 
split. Therefore, (/C © A, B) is a shift-split. By a similar argument, 
(.A, /C © 5) is a shift-split. 

□ 

Propositions 16.11 and 16.21 together show that the invariant subspace of 
T generated by the range of B is the minimal possible "/C2" occurring 
in a shift-split. We have already computed the minimal /C2, which is B. 
A similar argument can be used for the minimal /Q which is A. This 
implies the following. 
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Theorem 7.5. If the split-shift condition holds, 

B = V{T>Bf : / G wTl^j = 0, 1, . . . , n - 1} 

and 

A = V{(T*YA*f : / G wEl^j = 0, 1, . . . , n - 1} 

^ere A* = P £ i_ lm M 1/jB : l ^ flnrf T * = P^^My, : 

^rijm— 1 n— l,m' 

We can now prove the stratified characterization of Bernstein-Szego 
measures. 

Proof of Corollary \2.6[ Suppose T is a positive linear form on £„ im 
given by 

T(z>w k ) = I ziw k ff |dtp| |j| < n, |*| < m, 

i T 2 (2tt) 2 |p(2;,m;)| 2 

where p G C[z, has no zeros in T x D, degree at most (n,m) and 
0) has d zeros in D. We write p(z, 0) = 0(2)6(2) where b has all 
zeros in D and a has no zeros in D. By Theorem 14.8} T possesses a 
shift-split (Kx, /C 2 ) where /Ci has dimension d = degb(z). 

Next, supposing T possesses a split-shift (/Ci,/C 2 ) where /Ci has di- 
mension d, by Theorems 17.41 and 17.51 we have A C /Ci C ^4 © JC . 
Therefore, 

(7.1) dirndl < d < n - dim B. 

Finally, if T satisfies the matrix condition and (17. ip . then we see 
from Proposition 17.31 that it is possible to choose /Ci with dimension d 
and the corresponding split-poly p has the desired property that p(z, 0) 
has d roots in D. □ 



8. Construction of p from Fourier coefficients 

In this section, it is useful to write z = (zx, Z2) for an element of C 2 as 
opposed to (z, w), so that we can use multi- index notation z u = z^z^ 2 ■ 

Supposing the split-shift condition does hold, how do we construct 
p directly from the Fourier coefficients 

T(z- U ) = c u ? 

In principle, one could construct (/Ci,/C2) and then produce p as an 
element of S n ^ m (/Q © 2/C2); however, this is quite involved. In this 
section we describe a simpler procedure assuming we already know that 
the shift-split condition holds. 
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First, we construct an orthonormal basis for £% m . It helps to use 
interval notation for subsets of integers as in [0, n] = {0, . . . ,n}. Let 
Si = [0,n] x [0,m]\{(0,0),...,(0,j-l)}, S = [0,n] x [0,m]. Let 

and define 



),(o,i)" 



Then, fa, fa, ... , (fi m form an orthonormal basis for £ 2 m . To see this 
let u G Sj + i and 7 = v/t^j) (0 j)- We compute 

z u ) = ^ 7 ( oj> c «-«/7 = 5(o,i),«/7 = 

since (0, j) ^ Sj + ± and S^+i C Sj. For k > j, fa is a combination of z" 
with u £ Sk C Sj + \ and therefore 0fc _L 0-,- for k > j. Also, 

v,u£Sj u£Sj 



The reproducing kernel for is therefore E^z; () = J2jLo ( t ) j{ z ) c t ) j{C)- 
We assume p{zi,z 2 ) = q(zi)g(zi, z 2 ) with q stable and g has no 

factors with z\ alone and then show how to construct g and q using 

only the moments c u . Theorem 15.51 proves 

z?E 2 m (z; l/z u 0) = P {z)zZP(1/z u 0) = q(z 1 )g(z)z^q(l/z 1 )g(l/z 1 , 0). 

From this we can calculate g up to a constant multiple. The key point 
is that the product of all factors of the above polynomial that involve z\ 
alone will be the greatest common divisor of the coefficients of powers 
of z 2 . 

Let us write 

j=0 

and then compute Q = gcdji^o, E±, . . . , E m } using the Euclidean algo- 
rithm. Then, Q(zi) = Cq(zi)z™q(l/zi)g(l/zi,0) for some constant C. 
This gives 

z?E 2 m (z;l/z 1 ,0)/Q(z 1 )=g(z) 

possibly with a constant. At this stage we look at the one variable 
moment problem 

\dzi\ 



Cj = T(z 1 3 g(z)g(l/z 1 ,l/z 2 )) = / z 



-.1 
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for \j\ < no := n — deg z g. Set 



7i,fe 




and then we can construct 

no 



3=0 

(up to a unimodular multiple) by one variable theory. 
Hence, we have constructed p as p(z) = q(zi)g(z). 



9. Applications 



9.1. Autoregressive filters. A direct application of the above work 
is to two variable autoregressive models [22] . 

We consider (wide sense) stationary processes X = (X u ) u&1 2 de- 
pending on two discrete variables defined on a fixed probability space 
(Q,A,P). We shall assume that X is a zero mean process, i.e. the 
means E(X U ) are equal to zero. Recall that the space L 2 (Q,A, P) of 
square integrable random variables endowed with the inner product 



is a Hilbert space. A sequence X = (X m ) m€ z 2 is called a stationary 
process on Z 2 if for m, n £ Z 2 we have that 



It is known that the function Rx, termed the covariance function of 
X, defines a positive semi-definite function on Z 2 , i.e. 



for all k £ N, «i, . . . , «fc £ C, r\, . . . , £ Z 2 and Bochner's Theorem 
states that for such a function Rx there is a positive regular bounded 
measure fix defined for Borel sets on the torus [0, 2n] 2 such that 



for all two tuples of integers u. The measure fix is referred to as the 
spectral distribution measure of the process X. The spectral density 
fx(t) of the process X is the spectral density of the absolutely contin- 
uous part of fix, i-e. the absolutely continuous part of fix equals 



(X,Y) := E(XY*) 



E(X m X* n ) = E(X m+u X* n+u ) =: R x (m - n), for all u £ Z 2 . 



k 





fx(ti,t 2 ) 



dt\dt 2 



(27T) 2 • 
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Let H = {(k,l) : -oo < k < oo,l > 0} U {(Ac, 0), A; > 0} and let 
A„, m = {(kj) : < k < n,0 < I < m} C H U {(0, 0)} be a finite set. 
A zero-mean stationary stochastic process X = (X u ) u< zz 2 is said to be 
extended autoregressive or eAR(n,m), if there exist complex numbers 
a,k, k G A„ jm with a(j j0 ) 7^ for some < i < n, so that for every u 

(9.1) ^ a kX u -k = £ u , u G Z 2 , 

where {£ u : u G Z 2 } is a white noise zero mean process with variance 1. 
The eAR(n, m) process is said to be acausal (in z) if there is a solution 
to equations (19.11) of the form 

X u = ^2 (f)k£ u -k,u G Z 2 , 

fceHu{(o,o),(-i,o),...} 

with < oo and it is said to be causal if there is a 

k£HU{{0,0),(-l,0)...} 

solution of the form 

X u — cj) k £ u _ k , u G Z 2 , 

fcemj{(o,o)} 

with |0fe| < oo. From the general theory of autoregressive 

fcemj{(o,o)} 

models it follows that if (19. ip has a causal (acausal (in z)) solution 
then 

(9.2) p(z,w) = avZ Vl w V2 . 

is stable on D 2 (TxD). 

The bivariate extended autoregressive (eAR) model problem con- 
cerns the following. Given autocorrelation elements 

Cfc = E(X Xl), k G A n>m — A n>m 

determine, if possible, the coefficients ai,l G A n m of an acausal au- 
toregressive filter representation. In [10] necessary and sufficient con- 
ditions were given for the autocorrelation coefficients in order for the 
eAR(n, m) to have a causal solution. Here we give necessary and suf- 
ficient conditions in order for an eAR(n, m) model to have an acausal 
solution. 

If we begin with a polynomial that is nonzero for (z,w) 6 TxD 
then choosing the autoregressive filter coefficients as in equation (19. 2 p 
give an eAR(n, m) model whose Fourier coefficients give a linear form 
that is positive and satisfies conditions in Theorem 12.21 Conversely, we 
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can use the conditions in Theorem 12.21 to characterize the existence of 
£lC£lU.Scll (in z) solution. 

Theorem 9.1. Given autocorrelation elements Ck,i, {k, I) G A rejm — A n>m 
there exists an acausal (in z) solution to the eAR(n, m) problem if and 
only if the linear form T determined by the Fourier coefficients Ck is 
positive and satisfies one of the equivalent conditions of Theorem \2.2\ 

Corollary 9.2. With the hypotheses of the above Theorem there exists 
a casual solution to the eAR(n, m) problem if and if the linear form T 
determined by the Fourier coefficients Ck is positive and A = 0. 

9.2. Pull measure characterization. We now identify which mea- 
sures dfi are of the form 

\p(z,w)\ 2 

where p 6 C[z, w] has no zeros on T x D. 

Corollary 13 . 6 1 provides necessary conditions which can be encoded as 

(9-3) £n,M = £n+j,M 

for all j > and M > m — 1. By performing the reflection operation, 
it follows that 

z j-p2 _ x-2 

* J n,M J n+j,M 

for all j > 0,M > m- 1. 

It turns out that conditions (19. 3 j) for M = m — l,m are sufficient 
to show that the moments J z^w k d[i agree with the moments of a 
Bernstein-Szego measure when j G Z and \k\ < m (i.e. on a strip). It is 
then another issue to prove that the Bernstein-Szego measure obtained 
with a particular m agrees with other choices. 

Fix m and define A N = P w£% M z P £ i ,T N = P £ i M z P £ i , 

N,m — 1 iV — 1,771 iV — 1,777, N— 1,777. 

and B N = P £ i P wF 2 . 

N — l,m N,m, — 1 

Lemma 9.3. Assume (19. 3p holds for j > and for M — m — l,m. 
Then, A N T^B N = for N > n, k > 0. 

Proof. Let P = P\ + P2 be the projection onto the space 

k 

PN+k,m Q WV N -I,m-1 = £jv-l,m (J) ^N+j,™ 

j=0 

where Pi, P 2 are the projections onto £ jv_i ;m > ©J=o -^N+j,m respectively. 

Noting that z 0j =o J^ +jtm = 0* =o J^ +i+1>m by (Q, we have 
PiM 2 P 2 = 0. Then, T w = PjM^ = PiM 2 (P + P 2 ) = P X M Z P. 
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Therefore, for k > 1 we have 

Tx = P x M z {PM z P) k ~ x P. 
Similarly, A N = P w£ i M Z P X = P w£ 2 M Z P, so that 

N,m—1 N,m—l 

A N T k N = P w£%m _M z (PM z P) k P 



Next, PP.., T 2 = P,„f 2 while 

W - r N,m-l W ~ r N,m-l 



PM z P wT 2 = P wT 2 M z P wT 2 

since zwJ r f f+ j m _ 1 = wJ r % + j +1 ^ m _ 1 by (19. 3ft so that inductively we have 



fc-i 



A N T k B N = P^MzP^ ^ J] (M 2 P ? 



N-\-j,m— 1 



where the product is multiplied from right to left as j goes from to 
k — 1 (if fc = 0, the product is 7). But, P w£ 2 M z P wT 2 = as 

N,m—1 N"4-fc,m— 1 

Therefore, assuming (19.31) for M = m,m — 1 and j > 0, the matrix 
condition holds for the positive linear form on Cn,™ for N > n. So for 
each N, there is a p^ G C[z, u>] of degree at most (iV, m) with no zeros 
on T x D such that the Bernstein-Szego measure for p^ matches the 
moments of d\x on £>v,m- We can further assume that each p^ has been 
normalized so that px(z,w) = qN{z)gN(z,w) where q^ is stable in z 
and (?jv has no factors involving z alone. By Theorem 15.51 and (19.31) . if 
we set z = ( G T, w G C, r\ = 0, we get 



Pn(z, W)p n (z, 0) = Pn+j(z, Vj)p n+j (z, 0) 

for j > 0. This implies g n = g n+ j for each j > (after absorbing 
constants into g's if necessary) and then by stability of each q^, q n = 
q n+ j for each j > 0. Therefore, the moments of d\x on the strip \z?w k : 
j G Z, | A; | < m} are matched by those of l/|p n | 2 <io". 

Theorem 9.4. Let d/u, be a positive Borel measure. If 
£ 2 = £ 2 £ 2 = £ 2 

n,m n+j,rn n,m—l n+j,m—l 

for j > 0, then there exists p G C[z, w] of degree at most (n, m) with 
no zeros on T x D such that 

z j w k 



z ] w k dn= / — —da 

\p{z,w)\ 2 



for j G Z and \k\ < m. 
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To get the full measure characterization, we note that for a Bernstein- 
Szego measure (and recalling G v from Section [3]) 

G (z, w) = p(z, w)z n p(l/z, 0) 

is an element of the one dimensional space % m defined in f)2.10p . but 
by all of the orthogonality relations for Bernstein- Szego measures it is 
also m.'H.M for M >m. Therefore, a set of necessary conditions is 

T-i m = 7-Lm+j for j > 0. 

Theorem 9.5. Let dfi be a positive Borel measure on T 2 satisfying 

for M > m — 1 and j > 0. Then, there exists p 6 C[z, w] of degree at 
most (n, m) with no zeros on T x D such that 

d ^° 

\p(z,w)\ 2 

Proof. The conditions on S 2 . imply that for each M > m, there exists 
Pm G C[z, u;] of degree at most (n, M) with no zeros on T x D such 
that the moments of da /\pm\ 2 match those of dfi on the strip {z^w k : 
j G Z, \k\ < M}. We normalize Pm(z,w) = qM(z)gM(z,w) where qu 
is stable and gM has no factors involving z alone. 

Using the assumption % m = 1-L M for m > M, it follows that for each 
m> M 

p m (z,w)z n p m (l/z,0) = Cp M (z,w)z n p M (l/z,0) 

for some constant C. We then must have that g m and gM are constant 
multiples and then since q m , qu are stable, they too must be constant 
multiples of one another. Therefore, p m and pu must be constant 
multiples. The constant must be unimodular since p m and pu have 
unit norm. Therefore, the measures l/\p m \ 2 da = l/\pM\ 2 do~ match all 
of the moments of dfi. Hence, l/\p m \ 2 da = d\i. □ 

9.3. Concrete expression for the full measure characterization. 

The conditions 

for M > m — 1 and j > given above can be written directly in terms 
of the Fourier coefficients of /x 

c u = J z~ u dfi u = (ui,u 2 ) G Z 2 

as follows. Similar to Section [8] it is useful to write z = (zi, z 2 ) for an 
element of C 2 as opposed to (z, w) and we will use multi-index notation 
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z u = Zi x z% 2 . Also, [0, AT] = {0, 1, ... , iV}; there should be no confusing 
this with a closed interval of real numbers. 
Let 

r N,M\ — ( n \-l 

\lu,v )u,ve[0,N]x[0,M] ~ \ c v-u) u>ve l 0j N] x [ 0j M]- 

A basis for 8^ M consists of 

rN,M / \ N,M v 

U (*) = 1^ %*),v z 

ve[0,N]x[0,M] 

for j = 0, 1, ... , M. In order for £j; +1 M = S^m t° no ^ we nee d the 
coefficients of for k = 0, 1, . . . , M to vanish in M . Looking 

at /■ +1 ' this amounts to 



JV+l,Af _ n 

7(oj),(jv+i,fc) - u 



for j, fc = 0,1,..., M. 



Therefore, the conditions M = £%+j m f° r 3 — an d — 



m 



can be expressed as 



N+1,M _ n 

^(0j),(iV+l,fe) ~ U 



for N > n, M > m - 1, j, k = 0, 1, . . . , M. 
Next we turn to the conditions 

W m = Um+j for j > 0. 

Recall 

U M = V 2n . M e V{^'4 : < j < 2n, < k < M, (j, k) ± (n, 0)} 
so that a nonzero element of the one dimensional space Hm is given by 

9m(z) = ^,o),u zU 

ue[0,2n]x[0,M] 

where we define 

pM _ ( \-l 

Su,« v^-«/u,?;e[0,2n]x[0,M] - 

The condition H M = %m-\ can then be expressed as 

£( n ,o),(j,M) = j = 0, 1, . . . , 2n. 
Let us summarize everything. 

Theorem 9.6. Let n be a positive, finite measure on T 2 with moments 
c u for ugZ 2 , There exists a polynomial p £ C[z, u>] o/ degree at most 
(n, m) rai/i no zeros on T x D swc/i i/iai 

\p{z,w)\ 2 

if and only if 
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(3) for M> m, j = 0, l,...,2n 




where 



(lu,v )u,ve[0,N]x[0,M] — (Cv-u) 



-i 



u,v<=[0,N]x[0,M] 



10. Generalized distinguished varieties 

10.1. Construction of the sums of squares formula. Here we use 
Kummert's approach as in [19] to give a different proof of the sums of 
squares formula Theorem 12.41 One advantage of this approach is that 
it works for p with no zeros oeTxD and no factors in common with 
p (rather than assuming no zeros on T x B). This approach is also 
useful because it shows how to compute the reproducing kernels in the 
decomposition of p using only one variable theory. 

Theorem 10.1. Suppose p £ C[z, w] has degree (n,m), no zeros on 
TxB and no factors in common with p. Let ri2 be the number of zeros 
of p(z,0) in D and ri\ = n — n^- Then, there exist vector polynomials 



E E C m [z, w],Ae C ni [z,wlB E C n2 [z,w] such that 

|^(^,^)| 2 -|K^^)| 2 = (l-j^| 2 )|^(^,^)l 2 +(l-kj 2 )(|^4(^,^)i 2 -|^B(^,^)| 2 ). 



• E has degree at most (n,m — 1) and A and B have degree at 
most (n — l,m), and 

• the entries of A and B form a linearly independent set of poly- 
nomials. 

The last two details are needed in Section 110.21 
Consider for z E T 



where T(z) is an m x m matrix valued trigonometric polynomial which 
is positive definite for all but finitely many values of z E T. This is 
because T(z) is positive definite for each value of z such that p(z, ■) 



p(z, w)p(z, rj) - p(z, w)p(z, rj) 
1 — wfj 



(l,fj,...,fr- 1 )T(z)(l,w,...,w 



m—l\t 
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has no zeros in T. There can only be finitely many zeros on T 2 or else 
p and p would have a common factor. Let S = {z G T : detT(z) = 0}. 
By the matrix Fejer-Riesz theorem in one variable, we may factor 

T(z) = E(z)*E(z) 

where E is an invertible matrix polynomial on D of degree at most n. 
Let 

E(z,w) = E(z)(l,w,...,w m ~ 1 f 

so that 



p(z,w)p(z,rj) — p(z,w)p(z,rj) = (1 — wrj)E(z,r])*E(z,w) 

for z G T. We are using both the notations E(z) and E(z,w), but no 
confusion should arise. 

Then, for fixed z G T \ S, the map which maps 

p(z,w) \ ^ ( P~( z , w \ 
wE(z,w)J yE(z,w] 

extends to a unitary U (z) which we can explicitly solve for. Write 

m m 

P(Z,W) = ^2pj(z)w J , P(Z,W) = ^2p m ^ j (z)w : > . 

3=0 3=0 

Then, 



U(z) 



Pm{z) ■■■ Pl(z) Po{z)\ fpo{z) Pl(z) ■■■ p r , 

E{z) J \ E(z) 



which is unitary by construction for z G T \ S, but clearly extends 
to a matrix rational function with poles in D at the zeros in D of 
Po(z). Moreover, any singularities on T must be removable because U 
is bounded on a punctured neighborhood in T of each singularity. 

Set ri2 to be the number of zeros of p(z, 0) in D and n\ = n — ri2- 
Theorem 110.41 and Section [10. 1.21 below prove that 

(id) I " u(0*u(z) = F{c y F{z) _ G ( C y G ( z ) 

1 - (z 

where F is n\ x (m + 1) and G is n<i x (m + 1), and the rows of F 
and G are linearly independent as vector functions; meaning there is 
no non-zero solution (fi,f2) G C n to 

(10.2) Vl F(z) +v 2 G(z) =0. 

Accepting all of this for now, we rearrange ( 110. lfl to get 

I+(F(ty Z F(z) + G((yG(z) = U(0*U(z) + F(0*F(z) + (G(0*zG(z) 
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and so there exists an (m + 1 + n) x (m + 1 + n) unitary (with two 
indicated block decompositions) 



C m+i c 

v = c m+1 fv{ vi 



c 



c 1 c m+n 
f v x v 2 
v 3 v 4 



m+n 



(10.3) 

such that 
(10.4) 



Multiplying both sides of this equation by X(z,w 
gives 

/ p(z, w) \ ( p(z,w) \ 





p(z,w) 
wE(z, w] 



(10.5) 



V 



wE(z, w) 
zF{z)X{z,w 
\G(z)X(z,w) J 



E(z,w) 
F(z)X(z,w) 
\zG(z)X(z,w) J 



Let A(z,w) = F(z)X(z,w) and B(z,w) = G(z)X(z,w). The entries 
of A and B are linearly independent, because if v\ G C" 1 , v% G C™ 2 and 

= v x A{z, w) + v 2 B(z, w) = (uiF(z) + v 2 G(z))X(z, w) 

then since 



X(z,w) 



Po(z) Pi(z) ■■■ Pm(z) 
E(z) 



we have 







/ 1 \ 



The matrix on the right is invertible in D except at possible zeros of 
Po, so we get ViF(z) + v 2 G(z) = which implies V\ — and f 2 = 0. 

Taking the norm squared of both sides of (110. 5p gives the following 
formula since V is a unitary 

\p(z,w)\ 2 + \w\ 2 \E(z,w)\ 2 + \z\ 2 \A(z,w)\ 2 + \B(z,w)\ 2 

= \p(z,w)\ 2 + \E{z,w)\ 2 + \A(z,w)\ 2 + \z\ 2 \B(z,w)\ 2 . 

If we rearrange we get the desired sum of squares formula 

iK^^r-fe^)i 2 =(i-hr)iF(z,^i 2 +(i-ki 2 )(iA(^ W )i 2 -is^ W )i 2 ). 



POLYNOMIALS WITH NO ZEROS ON A FACE OF THE BIDISK 



41 



One final technicality is that while E has entries that are polynomials, 
it is not clear that the same holds for A and B. To show they are 
polynomials we go through a longer process of proving the following 
"transfer function" representation which is interesting in its own right. 
Set 

Wlm 



(10.6) 



A(z,w) 



r(z,w) 








zL s 
















' in 







z I, 



112 . 



Theorem 10.2. Suppose p G C[z, w] has no zeros m T x D, degree 
(n,m), and no factors in common with p. Then, there exists a (1 + 
m + n) x (1+m + n) unitary matrix V such that 

p(z, w) 



(10.7) 



p(z,w) 



V x + V 2 A(z, w)(T(z, w) - \Z 4 A(z, w))~ l V 3 . 



Here we use the block form indicated in (1 1 . 3 [) and again n 2 is the 
number of zeros of p(z, 0) in D and ri\ = n — n 2 . 

A technicality we must address is whether the matrix we invert above 
is non-degenerate. The fact that the rows of F and G are linearly 
independent is used to show this. 

By (I10.5p . the map sending 

/ p(z,w) \ 
wE(z, w 
zA(z, w] 
\ B(z,w) ) 

extends to the unitary V. Then 



/ p(z,w) \ 
E(z,w) 
A(z,w) 
\zB(z,w)J 



V 1 p + V 2 A(z,w 



V 3 p + ViA(z,w 




P 



T(z, w 




which implies 



pV 3 = (T(z,w)-V i A(z,w)) 
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We would like to invert the matrix on the right, so we need to make 
sure 

(10.8) det (TO, w) - V 4 A(z, w)) 

is not identically zero. This is equivalent to 

/ fwl m \ 
det zl m - V 4 
\\ z-H^j 

being non-trivial by simple matrix manipulations. The coefficient of 
w m will occur as 

where VI is the lower right nxn block of V as in (110. 3p . We shall show 
this determinant is non-vanishing for z6l If it does vanish for some 
z = ( G T, then there exists a nonzero v = (vi,v 2 ) G C™ = C ni+ " 2 such 
that 

K»>)( C o"' cl) = K " 2W ' 

This implies that \\v\\ = \\vVl\\ and since V is a unitary vV^ = 0. Then, 

by (DSHD 

(0,v 1 ,v 2 )V zF(z) = (0,C«i,C«2) ^(«) = (0,^1,^2) 
so that 

vttzFiz) + v 2 (G(z) = Vl F(z) + v 2 zG(z) 

and then 

(zC-l)v 1 F(z) + (C-z)v 2 G(z) = 0. 

This implies 

vxF(z) - Cv2G{z) = 

contradicting (110. 2p . Therefore, the determinant in fl 1 . 8 [) is not iden- 
tically zero, and 

(E{z,w) 

(10.9) p(z, w) (F(z, w) - V 4 A(z, w))' 1 V 3 = A(z, w) 

\B(z,w) j 
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which in turn yields (110. 7(1 . Examining f l 1 . 7(1 we see that since p has 
degree (n, m), since 



has degree at most (n,m), and since p and p have no common factors, 
we must have that p is a constant multiple of the above determinant 
else (110.7(1 could be reduced further. This implies that the left hand 
side of (110. 9p is a vector polynomial and finally we see that the entries 
of E, A, B are polynomials. By Cramer's rule the entries of E have 
degree at most (n, m — 1) and the entries of A and B have degree at 
most (n — 1, m). 

10.1.1. Unitary valued rational functions on the circle. The following 
is undoubtedly well-known material from systems theory; however, we 
were unable to find a suitable reference so we include a detailed expla- 
nation. 

Theorem 10.3 (Smith Normal form [2]). Let R be a principal ideal 
domain and let A be an N x N matrix with entries in R. There exists 
a unique (up to units) diagonal matrix D G R NxN , called the Smith 
Normal form, with entries D\\D2\ ■ ■ ■ |-Djv such that 



where S,T 6 j^NxN an( ^ g-i ^y-i g R NxN . The matrix S is formed 
through the row operations of (1) multiplying a row by an element of 
R and adding the result onto another row, (2) switching two rows, and 
(3) multiplying a row by a unit in R. 
The entries Dj may also be computed as 



where gcd (A) represents the greatest common divisor of determinants 
of all j x j submatrices of A (gcd := 1 ). 

Let U £ C(z) NxN be a. rational NxN matrix function of one variable 
which is unitary valued on the unit circle. Let R be the ring of fractions 
C[z]iS _1 where S is the multiplicative set S = {q G C[z] : q(z) ^ 
for z G D}. We may write U = - q Q where g e C[z] has all zeros in D 
and Q G R NxN . Let D be the Smith Normal form of Q in R. Write 



det 




A = SDT 



gcd,(A) 




1 Qj 
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in lowest terms and define 

N 

Ni = # zeros of dj in © counting multiplicity 

N 

N 2 = # zeros of in D counting multiplicity. 

i=i 

Theorem 10.4. With U as above, there exist an N\ x N matrix func- 
tion F and an N 2 x N matrix function G such that 

I - U(()*U(z) = F{c y F{z) _ G{c y G{z y 
1 - < 

The rows of F and G together form a linearly independent set of vector 
functions on D. 

Proof. We shall use if 2 to denote the vector valued Hardy space if 2 (T)cg> 
for short. Now UH 2 is a reproducing kernel Hilbert space with 
point evaluations in D except at the poles of U in D. In L 2 = L 2 (T) <g> 
C^, if / G H 2 and v G C N 

which shows U H 2 has reproducing kernel 

ujzMcy 

1-zC 

Notice UH 2 is not necessarily contained in H 2 . The space UH 2 V H 2 
is therefore a reproducing kernel Hilbert space containing both spaces. 
Consider the kernel 

1 - K 

which is not necessarily positive definite but is rather the difference of 
two reproducing kernels 

We shall in general use K-u to denote the reproducing kernel of a space 
U in UH 2 V H 2 . We can decompose H 2 = {H 2 n UH 2 ) © (H 2 Q (if 2 n 
UH 2 )) so that 



and similarly 



Kg 2 — Kg 2c]U ij2 + KH2 e (H 2 nuil 2 ) 



Kuil 2 ~ ^H 2 r\UH 2 + ^UH 2 e{H 2 nUH 2 )- 
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Therefore, 



K ~ ^H 2 e(H 2 DUH 2 ) ^UH 2 e(H 2 nUH 2 ) 



The spaces H 2 Q (H 2 n UH 2 ), UH 2 Q (H 2 n UH 2 ) are actually finite 
dimensional. We can compute their dimensions as follows. 

Write U = \Q where q has all zeros in D and Q has entries in 
R. Note that since U is unitary on the circle, U has no poles on the 
circle (U is bounded near any potential singularities). Therefore, the 
entries of Q belong to the smaller ring Rq = C^Sq 1 where «So is the 
multiplicative set «S = {q G C[z] : q{z) 7^ for z G D}. By Theorem 
110.31 we ma Y write Q = SDT where S, T are matrices with entries in 
R whose inverses have the same property, and D is the Smith Normal 
form of Q in Rq . The elements Dj G Rq have no zeros on the unit circle 
since detU = det Q/q N = detS'detT Yl(Dj/q) has no zeros on T and 
q has no zeros on T. So, Q has the same Smith Normal form D in R 
by the gcd characterization of the Smith Normal form. 

Now, H 2 Q(H 2 nUH 2 ) is isomorphic as a vector space to the quotient 
H 2 /(H 2 n UH 2 ), and since TH 2 = H 2 = SH 2 we see that 

H 2 /(H 2 n UH 2 ) H 2 /(H 2 n -DH 2 ) 

Q 

which breaks up into the algebraic direct sum of the spaces 

H 2 /(H 2 n^H 2 ). 

q 

Any zeros of Dj or q in C \ D can be absorbed into H 2 so that ^-H 2 = 
—if 2 for some oL and a 
canceling). The space 



^H 2 for some dj and qj with all zeros in D and no common zeros (after 



H 2 n ^H 2 = d,H 2 

and H 2 /djH 2 has dimension equal to the number of zeros of dj in D. 
Therefore, H 2 /(H 2 n UH 2 ) has dimension equal to 

N 

Ni = # zeros of ^ in D counting multiplicity. 

5=1 

A similar analysis shows that UH 2 / (H 2 (1UH 2 ) has dimension equal 

to 

TV 

= # zeros of qj in D counting multiplicity. 

3=1 
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If • • • , /jvi} is an orthonormal basis for H 2 (H 2 fl UH 2 ) and 
{gi, . . . , (7at 2 } is an orthonormal basis for UH 2 (H 2 fl UH 2 ) then 

k(z; o = j2 fjwmr - E &(^(cr 

which if we form an iV x iVi matrix F — . . . , /jy-J and an iV x JV 2 
matrix G = (</x , • • • , 9n 2 ) can rewrite as 

J ~^^ (Cr = F(z)F(CT - G(z)G(0*. 

Since H 2 e(H 2 C)UH 2 ) and U H 2 Q (H 2 DU H 2 ) have trivial intersection, 
the columns of F and G form an independent set of vector functions. 

Of course, applying the above work to IP instead would yield a 
formula of the form 

I - U(()*U(z) = F{c y F{z) _ G{c y G{z) 

i - K 

after switching z and £ and taking conjugates, where now F and G are 
N\ x iV and N 2 x N valued respectively. Note we are not saying they 
are the same F and G as before, but the dimensions N\ and N 2 are 
preserved because the transpose does not change the diagonal term in 
the Smith Normal form. The rows of F and G form an independent 
set of vector functions just as above. □ 

10.1.2. A particular choice of U . Recall the matrix function U from 
Section ITon 

Pm( Z ) PlO) Po0)\ (Po(z) Pl(z) ■■■ Pmiz)]' 1 

E(z) \ E(z) 



U{z) 
Note 



T T -}_n- — (v m "' P 1 M ( l ~( Pl "' Vm ) E X \ 

~ Po Po\ E 0){0 poE-i ) 

where Q has entries in R since det E may have zeros on T while po(z) = 
p(z, 0) has no zeros on T. We now show how to compute the Smith 
Normal form of Q in R. 
Now, 



Q 



1 0\ fpm ■■■ Pl Po\ A - (Pl ■■■ Pm)\ fl 

E J 1 / J \ p / M 



It is not hard to see that the product of the inner two matrices can be 
converted to the diagonal matrix D with entries l,po, . . . ,Po,PoPo using 
row and column operations in R, which is the Smith Normal form of 
Q. The entries of are then l/po, l,Po- 
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This proves that for n 2 equal to the number of zeros of po in D and 
rii = n — n 2 

i - u(cyu(z) = F{cyF{z) _ G{CTG{z) 

1 - Cz 

where F is rt\ x (m + 1) and G is n 2 x (m + 1). 

10.2. Generalized distinguished varieties and determinantal rep- 
resentations. Distinguished varieties are a class of curves introduced 
in Agler-McCarthy [2] because they play a natural role in multivariable 
operator theory and function theory on the bidisk (see [T7j, [T5], PQ). 
The zero set Z p of a polynomial p G C[z, w] is a distinguished variety 
if 

Z p C D 2 U T 2 U E 2 

where E = C \ D. Notice the curve Z p fl D 2 exits the boundary of D 2 
through the distinguished boundary T 2 ; hence the name distinguished 
variety. This area is part of a larger topic of understanding algebraic 
curves and their interaction with T 2 . See [3], [3]. 

The sums of squares theorem, Theorem 12.41 or Theorem 110.11 nat- 
urally leads to the study of a more general class of curves using the 
methods of [17]. We say that the zero set Z p of p G C[z, w] is a gener- 
alized distinguished variety if it satisfies 

(10.10) Z p C (D x C) UT 2 U (E x C) or 

Z p C (C x D) UT 2 U (C x E). 

That is, Z p does not intersect the (relatively small) set (T x D) U (T x E) 
in the former case above. We shall show that generalized distinguished 
varieties share much of the structure of distinguished varieties, and in 
particular they possess a determinantal representation generalizing one 
of the main theorems in [2]. We use the notation ( 110. 6p below. 

Theorem 10.5. Suppose p G C[z, io] has degree (n,m), has no factors 
involving z alone, and satisfies ( 110. 10p . Then, there exists an (m + 
n) x (m + n) unitary matrix U such that p is a constant multiple of 

det (UA(z,w) -T(z,w)), 

where ra 2 is the number of zeros of p(z, 0) in D and n\ = n — n 2 . 

If Z p is a distinguished variety then n 2 = n and we get the represen- 
tation 

Z r ={ M : det (u(^ »))=„ 
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'A B 

If we write U = [ „ ^ ) , then the above zero set can be written as 



C D. 

det($H - zl n ) = 

in terms of the matrix rational inner function 

= D + wC(I - wA)- 1 B 

at least outside of the poles of $. This is how the characterization of 
distinguished varieties is stated in [2]. 

Lemma 10.6. Suppose p G C[z, w] has degree (n,m), has no factors 
involving z alone, and satisfies (110.101) . Then, p = fip for some /i£T. 

Proof. For each zeT, p(z, •) has all zeros in T — as does p(z, •). Since 
z G T, we see that p(z, •) and p(z, •) have the same roots (counting 
multiplicity, since they approach zero at the same rate near a root 
because \p\ = \p\ on T 2 ). Therefore, if we write 

m m 
p(z,w) = ^2p j (z)w : >, p( Z i W ) = ^2Pm-j(z)w J 

j=0 j=0 

then for all z 6 T, p m (z)p(z, ■) = p (z)p(z, •) since these polynomials 
have the same roots and same leading coefficient. Hence, p m (z)p(z, w) = 
p Q (z)p(z, w) for all (z, w) G C 2 . By assumption p has no factors involv- 
ing z alone, and therefore p divides p. Similarly p divides p, so that 
p = Cp for some constant C. Since \p\ = \p\ on T 2 , C must be uni- 
modular. □ 

Because of this lemma we can assume p = p by replacing p with an 
appropriate constant multiple. 

Lemma 10.7. Suppose p = p G C[z,u>] has degree (n,m) satisfies 
(110.101) and is irreducible. Then, 

• mp = + w^r- and 

aw aw 

• has no zeros in T x © and no factors in common with -f 2 -. 

aw J aw 

We reflect dp/dw at the degree (n, m — 1). 

<— 

Proof. The identity mp = + is straightforward assuming p = p. 
For t < 1, let Pt(z, w) = p(z, tw). Then, p t has no zeros inTxP and 

\p t (z,w)\ 2 -\p t (z,w)\ 2 >0 

for (z, w) G T x D. Therefore, 

r \Vt{z,w)\ 2 - \pt(z,w)\ 2 
t/i 1 - t 2 
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but the above limit equals 

m\p(z, w)\ 2 — 2Re(u;— — p(z, w)) > 
dw 

<— 

for (z, wjeTxD. Now, since mp = J| + 

21 l M2 o B /- 9 P~7 v\ i^P|2 i 9 P|2^ n 

m p(z, u>) — Zmiieiw—— p[z, w)) = -— -10— > 0. 

dw dw dw 

<— 

Therefore, any zero of in T x D is a zero of w^ and hence will be 

<— 

a zero of p, which by assumption has no zeros in T x D. So, J-^ has no 
zeros in T x D. 

Now, -f^ can have no factors in common with else w^- and 

' aw aw ' otu 

p have a common factor. As p is assumed to be irreducible, this is 
impossible. □ 

Proof of Theorem \10.5\ It is sufficient to prove the theorem for irre- 
ducible p = p since we can write the determinantal representation in 
terms of blocks corresponding to each irreducible factor of p. With 

this assumption has no zeros onTxD and no factors in common 

with w^, and mp(z, 0) = J^(z,0) has n 2 zeros in D. The proof of 
Theorem 110.11 says there are vector polynomials A e C m [z, w], B G 
C ni [z,w],C G C n2 [z,w] such that 



dp dp _ dp dp 

^-(z,w)—(C,v) -wr]—(z,w) — (£77) 

equals 

(1 - wt7)A(C, 77)M(z, w) + (1 - z()(B((, rj)*B(z, w) - C(C, r/)*C^, «;)). 

By Theorem 1 1 . 1 1 we can choose A, B, C so that A has degree at most 
(n, m — 1) while B,C have degree at most (n — l,m). Furthermore, 
the entries of B and C together form a linearly independent set of 
polynomials. 

The identity mp = + w^ proves 



2 dp dp dp dp _ dp dp 

m pp- m(w—p) - mpr]— = — wrj— — . 

dw dw dw dw dw dw 

On the zero set Z p we get the formula 
= (l-wr ] )A(C,r ] yA(z,w) + (l-zO(B(C,vrB(z,w)-C(C,vrC(z,w)). 
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A lurking isometry argument now produces the formulas we want. 
First, we rearrange 

wfjA{(, r))*A(z, w) + z{B((, rj)*B(z, w) + C((, r))*C(z, w) 
= A(C, V )*A(z, w) + B((, V )*B(z, w) + z(C((, V )*C(z, w) 

for (z,w), (C, 7 ?) £ Z p . Then, the map 



/ wA(z, w)y 




(A(z,w)\ 


1 zB(z,w) 


hi 


B{z,w) 


\C(z,w) J 




[zC(z,w) 



extends to a well-defined unitary on the span of the elements on the 
left (as (z, w) varies over Z p ) to the span of the elements on the right. 
Since the ambient spaces have the same dimension we can extend to 
an (m + n) x (m + n) unitary U such that 





( wA(z, w)) 




(A(z,w)\ 




zB(z, w) 


\-\ 






{ C(z,w) J 




[zC(z,w)l 



on Z p . Then, 

(UA(z,w) -T(z,w)) ( B I = 

w 

and since A, B, C vanish at only finitely many points in Z p , we get 

(10.11) det{UA(z,w) -T(z,w)) = 0. 

The polynomial on the left has degree less than or equal to that of p 
and vanishes on Z p . Since p is irreducible, it must either be a nonzero 
multiple of p or it must be identically zero. 

Claim: The determinant in (110. lip is not identically zero. 

The explanation is similar to before. If the determinant is identically 
zero, then by simple matrix manipulations 



U \ = 0. 




The coefficient of w m is 



zl 



det(( Q ni Z - XJ )-U 4 ) = 



l>2 



where U4 is the lower-right n x n block of U. The above determinant 
cannot vanish for any z = ( e T, since if it does there exists a non-zero 
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vector v = (vx,v 2 ) such that 

(CVl,C*>2) = (Vl,V 2 )U 4 . 

Then, on Z p 

(wA(z,w)\ /wA(z,w)\ / A(z,w)\ 

zB{z,w) = (0, Vl ,v 2 )U zB(z,w) = (0,vi,v 2 ) B{z,w) 
C(z,w) J \C(z,w) ) \zC(z,w)J 

and so (zviB(z, w)+(v 2 C(z, w) = v 1 B(z, w)+zv 2 C(z, w) which implies 

(C* - l)v!B(z, w) + (C- z)v 2 C{z, w) = 

which in turn implies 

(viB(z, w) — v 2 C(z, w) = on Z p 

since z = ( for finitely many (z, w) G Z p . Since p is irreducible, p 
divides C,v\B — v 2 C. Since B and C have degree at most (n — l,m), 
we see that (viB — v 2 C = 0, which is not possible unless V\ and v 2 are 
zero vectors, which they are not. 

So, the determinant in (110.111) is not identically zero. It follows that 
p is a multiple of the determinant in (110.111) . 

□ 
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